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Background: Delirium is common in the early stages of hospitalization for a variety of acute 
and chronic diseases. 

Objectives: To evaluate the diagnostic accuracy of two delirium screening tools, the Confusion 
Assessment Method (CAM) and the Confusion Assessment Method for the Intensive Care Unit 
(CAM-ICU). 

Methods: We searched MEDLINE, EMBASE, and Psychlnfo for relevant articles published 
in English up to March 2013. We compared two screening tools to Diagnostic and Statistical 
Manual of Mental Disorders IV criteria. Two reviewers independently assessed studies to 
determine their eligibility, validity, and quality. Sensitivity and specificity were calculated 
using a bivariate model. 

Results: Twenty-two studies (n = 2,442 patients) met the inclusion criteria. All studies dem- 
onstrated that these two scales can be administered within ten minutes, by trained clinical or 
research staff. The pooled sensitivities and specificity for CAM were 82% (95% confidence 
interval [CI]: 69%-91%) and 99% (95% CI: 87%-100%), and 81% (95% CI: 57%-93%) and 
98% (95% CI: 86%-100%) for CAM-ICU, respectively. 

Conclusion: Both CAM and CAM-ICU are validated instruments for the diagnosis of delirium 
in a variety of medical settings. However, CAM and CAM-ICU both present higher specificity 
than sensitivity. Therefore, the use of these tools should not replace clinical judgment. 
Keywords: confusion assessment method, diagnostic accuracy, delirium, systematic review, 
meta-analysis 

Introduction 

Delirium is characterized by acute onset of an altered level of consciousness with 
fluctuating levels of orientation, memory, thought, and/or behavior. 1 It is commonly 
observed among patients with an acute medical condition, especially patients in the 
internal medicine, neurology, psychiatry, and geriatric wards. Delirium has been asso- 
ciated with unfavorable outcomes including higher mortality, longer hospitalization, 
and a greater degree of dependence after discharge. 2 Therefore, early recognition and 
prevention of delirium may improve outcomes in hospitalized patients. 

Currently, two standard diagnostic criteria on delirium are the Diagnostic 
and Statistical Manual of Mental Disorders (DSM) IV criteria 3 and International 
Classification of Diseases- 10. 4 However, neither of these diagnostic tools can be 
easily applied to daily bedside practice. Therefore, a variety of screening tools such 
as the Confusion Assessment Method (CAM), 5 the Delirium Rating Scale, 6 the 
Intensive Care Delirium Screening Checklist, 7 and the Nursing Delirium Screening 
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Scale 8 have been developed. Among them, CAM is one of 
the most widely used diagnostic instruments for clinical and 
research purposes with proven psychometric properties. 9 
It was developed by Inouye et al based on DSM III for 
the purpose of enabling nonpsychiatric trained clinicians 
to identify delirium. 510 Ely et al developed the Confusion 
Assessment Method for the Intensive Care Unit (CAM- 
ICU), 11,12 an instrument specifically designed to assess 
nonverbal patients (ie, mechanically ventilated) based 
on the CAM algorithm. The objective of this study was 
to investigate whether a nonlanguage based method has 
impact on diagnostic accuracy of the scale and CAM-ICU 
performance in identifying delirium in a different spectrum 
of patients in comparison to CAM. Therefore, in this review, 
we compared CAM and CAM-ICU to DSM iy a golden 
standard for delirium diagnosis applied to verbal versus 
nonverbal patients. 

Materials and methods 

Search sources and searches 

Two reviewers (QS and LW) conducted a literature search 
of MEDLINE, EMBASE, and Psychlnfo in March 2013. 
Computer searches based on keywords were conducted. 
References from previously retrieved articles were also 
searched. Details of search strategy can be found in 
Appendix 1. 

Study selection 

Research articles examining either CAM or CAM-ICU 
were included for review if they met the following inclu- 
sion criteria: (1) the study was designed as an observational, 
cross-sectional or case series study; (2) the reference test was 
DSM IV for delirium diagnosis; (3) diagnostic accuracy 
estimates were reported in the paper, including sensitivity, 
specificity, true positive, false positive, true negative, and 
false negative, or sufficient detail to derive these numbers; 
and (4) the study was written in English. 

We excluded papers on prevalence of delirium without 
diagnostic accuracy data, reference test other than DSM IV, 
or if the definition of delirium was unclear in the original 
paper. Figure 1 shows a flow chart of the search. 

Data extraction and quality assessment 

Data were extracted to a form which included the following 
information: first author, year of publication, study popu- 
lation characteristics, name of tool, assessor of screening 
and reference test, diagnostic cut point used, and length of 
administration. 



Quality Assessment of Diagnostic Accuracy Studies 
(QUADAS-2) guidelines 13 were employed for this system- 
atic review to assess the study quality. For the purposes of 
this systematic review, we decided that a low risk of bias 
was assumed if all the questions were answered "yes." If an 
answer was either "no" or "unclear," a high risk of bias or 
"unclear" was assigned to the corresponding domain. In the 
patient selection domain, we considered the studies which 
included conditions related to mental disorders (ie, demen- 
tia, psychosis) which mimic delirium as an "appropriate 
inclusion" because we believe a good scale can discriminate 
between people who have delirium and people who have 
other mental disorders. For the index domain, we considered 
that a threshold was prespecified in translation versions. In 
domain 4, flow and timing, we considered 3 hours between 
the index test and reference standard as a reasonable time 
interval due to the fluctuations of delirium presentation. We 
appraised other items following guidelines. 13 For the appli- 
cability domain, studies were considered to be "high risk of 
bias" if no explicit description of inclusion and exclusion 
criteria were reported. 

All data extraction and quality assessment were con- 
ducted by two reviewers (QS and LW). Discrepancies were 
resolved by discussion. A third reviewer (GS) was consulted 
if discrepancies remained. 

Data synthesis and analysis 

The estimates for sensitivity, specificity, and negative and 
positive likelihood ratios were computed using bivariate 
models 14 and compared to the Moses-Littenberg method. 15 We 
used a bivariate model which preserves the two-dimensional 
nature of sensitivity and specificity and treats these two esti- 
mates as a paired index, thus accounting for the possibility of 
correlation which other methods do not address. In contrast to 
a traditional funnel plot, which uses straight lines for pooled 
estimates, the bivariate model produces an ellipse with a 
pooled mean sensitivity/specificity, 95% confidence interval, 
and 95% prediction ellipse. As both the bivariate model and 
the Moses-Littenberg method require a 2 x 2 data table, in 
this meta-analysis, we either retrieved the data directly from 
the study or generated numbers 16 based on the information 
presented in the original paper. Bivariate analyses were per- 
formed with PROC NLMixed with SAS (version 9.2, SAS 
Institute Inc, Cary, NC, USA) and figures were plotted using 
R (version 2.14, Lucent Technologies, Murray Hill, NJ, USA). 
We also built a summary receiver operating characteristic 
(SROC) curve using RevMan. 17 Statistical significance was 
set a priori at an alpha level of 0.05. 
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Search results combined (MEDLINE, EMBASE, 
and Psychlnfo) n = 2068 





Duplicate 


► 


n = 564 



Articles screened on basis of title and 
abstract n = 1 504 



Manuscript review with application of 
inclusion and exclusion criteria n = 84 



Excluded n = 1420 



Not exam screen scale: n = 384 



Not diagnostic study: n = 298 



Study scale does not compare with 
DSM-IV: n = 34 



Outcome is not delirium: n = 496 



Pediatric population: n = 45 



Review: n = 140 



Commentary: n = 23 



Studies included in 
systematic review n = 22 



Excluded n = 66 



Not exam screen scale: n = 14 



Not diagnostic study: n = 12 



Study scale does not compare with 
DSM-IV: n = 6 



Outcome is not delirium: n = 29 



Pediatric population: n = 5 



Figure I Result of literature search. 

Abbreviation: DSM-IV, Diagnostic and Statistical Manual of Mental Disorders IV. 



Results 

Characteristics of studies 

A total of 22 studies 811,12 ' 18-36 were identified for this sys- 
tematic review (Table 1). Nine studies examined CAM 
(n = 1,033) while 13 assessed for CAM-ICU (n = 1,409). 
Overall, the mean age of patients included in these studies 
ranged from 54 to 85 years old. The percentage of patients 
diagnosed with delirium varied from 14% 25 to 87%. 12 The 
most common study populations were ventilated intensive 
care unit patients, 11 ' 12 geriatric inpatients, 19,21 ' 22 or postop- 
erative patients. 8 - 25 - 26 Most studies examined the accuracy of 
delirium diagnoses made by nondelirium health profession- 
als such as general practitioners, nurses, or trained research 
assistants compared to delirium experts such as psychiatrists 



or geriatricians. In these studies, both CAM and CAM-ICU 
could be administered as a quick questionnaire. A number of 
studies 18 ' 21 ' 27 ' 29 focused on delirium which developed in the 
early stage of an acute disease while others monitored the 
patients from admission to discharge. 1112,24 

Study quality 

There were no disagreements in quality assessment between 
the reviewers that affected the categorization of studies 
as high or low risk of bias (Table 3). All 22 studies were 
considered to have a low risk of bias with respect to the 
reference test part. However, 13 studies had a high risk of 
bias for the patient selection criteria due to the inappropriate 
exclusion of demented or psychotic patients who were easily 



Neuropsychiatric Disease and Treatment 20 1 3:9 



submit your manuscript | www.dovepress.com 
Dovepress 



1361 



Shi et al 



Dovepress 



o ss 

= I 

T3 s. 
11 



t V 

rt N 
1/1 "7, 



0) 



in J" 

2 » a 



0 -M 



u) O 

« S 

*■ rt 

t- M L 

O c *» 

- •= .!£ 

■s = = 

E » t 



J) t 



c 
o 

« "is 

V — 

00 3 

h °- 

rt a 

I- Q. 



oo — 



m — 



* .2 



* E 



0_ C DO 0_ DO D_ 



O 

Z 



y 

z 
< 
0 



z> 
u 



o 

" D 

£ y 

g z 

= < 

CO "o 



> -a 

> a 



o 
Z 



(J 



U 6 iE 



> y 

8 z 

U o 



D 

u 



u 

oo 

LU 

£ 9 

< -3 
u z 



E c 



A 0 



J) fi L 

"5 ■= 'i^ 



tu O 

DO 00 

S I 

ft) 



5f 2 



U 

LU 

£ 9 

< -3 
u z 



Q- _ rt <L> TJ _ 



Z 
< 

u 



y 

z 
< 

0 



£ £ 



=> U 

y is 

z 9 

< -5 

u z 



D 

u 



o 
Z 



£ U 



z 
< 
u 



o 

z 



o 
Z 



U 

LU 

£ 9 

< -3 
u z 



.2 D 

e u 

z 
< 

u 



— z 
e < 

$ 0 O "5 



5 u 



3 1 
•M, .1 y 



o 



o 

I 



Jz U 



o 



13 J 



=j o 



| submit your manuscript 

Dovepress 



Neuropsychiatric Disease and Treatment 20 1 3:9 



Dovepress 



Review and meta-analysis of confusion assessment method 



— NO 



— in 



— in — 



r~ o 



0 
Z 



D 
U 



1 5 



D 








o 


D 




y 


z> 


0 
l/l 






y 


D 


z 


y 


1- 
CD 


Z 


> 


z 


y 


of CA 


CAM- 


Thai v 


of CA 


Greek 


of CA 


CAM- 



E 

e -a 



o 
Z 



o 
Z 



o 
Z 



z 
< 
u 



z 
< 



> y 

° z 

Irt ^- 

0) < 

■g u 



-o 3 

a u 





ted 


ours 


acute 




general 


















E 


■r 


t-< 




c 








CN 


S 








t-J 


rt 


Al 










en 


'£> 


D 


en 


CD 


en 


tal 


rt 


at 


U 


rt 


n 




'q. 


Inp 


Pat 


to 


Inp 


stn 


Inp 


ho; 



D_ 00 

~ Al 



c =1 
cu U 



o 



o 

QJ 



o 



(J 



E 

rt 
< 



i Di - i 1 Al Ql 



U aL J= 



o 

r- 



y 

z 
< 
u 



D 

U 



-S o 

w LU 

.E Q 



i z" 



< u 



confused with delirium patients. One quarter of studies did 
not explain whether the assessor was blinded either to the 
reference test or index test. Therefore, these potentially 
unblinded studies were assigned as "high risk" for bias. 
Approximately 20% of studies scored as unclear or a high 
risk of bias because the interval time between the two tests 
exceeded 3 hours. Most studies had high applicability. Only 
three studies had a high risk of bias for the patient selection 
criteria as a result of vague descriptions of inclusion and 
exclusion criteria. 

Test accuracy of CAM and CAM-ICU 

The psychometric properties of delirium screening tests 
are summarized in Table 2. Overall, CAM and CAM-ICU 
demonstrated similar performance characteristics when 
diagnosing delirium in both ventilated and nonventilated 
patients. The sensitivity of pooled CAM and CAM-ICU 
were 82% (95% confidence interval [CI]: 69%-91%) and 
81% (95% CI: 57%-93%), respectively. The specificity of 
the two scales were 99% (95% CI: 87%-100%) and 98% 
(95% CI: 86%-100%), respectively. Figure 2 shows the 
bivariate summary estimates of sensitivity and specificity 
for CAM and CAM-ICU diagnostic accuracy. It also 
contains the 95% CI and prediction ellipses. The red dot 
shows the mean estimate of CAM-ICU, which was very 
close to CAM although the sensitivity was slightly lower. 
The solid line represents the 95% CI. We observed that 
CAM-ICU had a similar variance in comparison to CAM. 
The dotted lines represent the 95% prediction ellipses. 
As expected, CAM-ICU had a wider prediction range 
than CAM. 

Figure 3 shows the SROC curves for all 22 studies. Six 
studies were identified to deviate from the average ROC 
curves. Three studies had high specificity but low sensitivity 
while another three had moderate to high sensitivity and 
specificity. Otherwise, all other studies showed high val- 
ues for both sensitivity and specificity. The funnel plot 
(Figure 4) shows the distribution of sensitivity and speci- 
ficity across studies. A greater degree of homogeneity in 
specificity was observed across studies in comparison to 
sensitivity. 

Due to a lack of an adequate number of studies, we could 
not perform subgroup analysis to investigate potential factors, 
such as patient characteristics, type of disease, and difference 
in assessors influence on accuracy. On the other hand, CAM 
shows a good reliability across the studies, ranging from 86% 
to 94%. CAM-ICU demonstrated more variance in reliability, 
ranging from 20% to 96% (Table 2). 
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Table 2 Diagnostic accuracy of included screening test 



Test 



Sensitivity (%) 



Specificity (%) 



PPV (%) 



NPV (%) 



Reliability (%) 



CAM 

Fabbri 18 

Laurila 21 

Gonzalez" 

Radtke 25 

Leung 22 

Ryan 27 

Radtke 26 

Wongpakaran 2 ' 

Adamis 33 

Ryan 35 
Thomas 31 
Pooled CAM 
CAM-ICU 

Ely 1112 

Ely 1112 



Lin 23 

Luetz 8 
Guenther 32 

Gusmao-Flores 30 
Heo 20 

van Eijk 28 
Neufeld 34 
Mitasova 24 
Wang 36 

Pooled CAM-ICU 



94.1 

81.2 

90 

42.9 

76 

88.2 

74.9 

91.9 

Rater 1:87.5 
Rater 2: 79.2 
71 
74 

82 (69, 91) 

Nurse I: 100 
Nurse 2: 93 
Nurse I: 95 
Nurse 2: 96 
Intensivist: 100 
Rater 1:91 
Rater 2: 95 
81 

Rater 1 : 88 
Rater 2: 92 
72.5 

Nurse I: 89.8 

Nurse 2: 77.4 

46.7 

18 

76 

Nurse I: 91.8 
Nurse 2: 93.4 
81 (57, 93) 



96.4 
83.7 
100 
98.5 
100 
100 
100 
100 

Rater I: 91.5 
Rater 2: 87.2 
83 
100 

99 (87, 100) 

Nurse I: 98 
Nurse 2: 100 
Nurse I: 93 
Nurse 2: 93 
Intensivist: 89 
Rater 1 : 98 
Rater 2: 98 
96 

Rater I: 100 
Rater 2: 1 00 
96.2 

Nurse I: 72.4 

Nurse 2: 75.8 

98.1 

99 

98 

Nurse I: 90.8 
Nurse 2: 87.7 
98 (86, 100) 



84.2 
76.5 
100 
81.8 

100 
100 

Not reported 
100 

Rater 1 : 84 
Rater 2: 76 
75 
100 



Nurse I: 92 
Nurse 2: 100 
Not reported 



Rater 1:91 
Rater 2: 9 1 
Not reported 
Rater I: 100 
Rater 2: 100 
90.6 

Not reported 

94.6 

88 

91 

Nurse I: 90.3 
Nurse 2: 87.7 



98.7 
87.2 
96.9 
91.6 
92.6 
94.9 

Not reported 
90.6 

Rater 1 : 94 
Rater 2: 89 
80 

91 



Nurse I: 100 
Nurse 2: 98 
Not reported 



Rater 1 : 98 
Rater 2: 99 
Not reported 
Rater 1:91 
Rater 2: 94 
87.4 

Not reported 

72.2 

85 

94 

Nurse I: 92.2 
Nurse 2: 93.4 



86 

Not reported 
89 

Not reported 
94 

Not reported 

100 

91 

75 

Not reported 
Not reported 



96 

79-95 

91 

89 

Rater 1 : 94 
Rater 2: 96 
Not reported 
81 

63 
20 
94 
92 



Note: Figures in brackets 95% CI. 

Abbreviations: CAM, Confusion Assessment Method; CAM-ICU, Confusion Assessment Method for the Intensive Care Unit; NPV, negative predictive value; PPV, positive 
predictive value; CI, confidence interval. 



Discussion 

The diagnosis of delirium in different settings constitutes a 
challenge. Despite a variety of existing tools, limited infor- 
mation is available on the performance and psychometric 
properties of these tools. In the present systematic review and 
meta-analysis, we identified 22 papers with 2,449 patients 
which studied the accuracy of the two most commonly used 
tools (CAM and CAM-ICU) in the diagnosis of delirium 
over the past decade against the standard delirium diagnostic 
test (DSM-IV). 

CAM and CAM-ICU are tools which can be adminis- 
trated quickly, and have high sensitivity and specificity for 
early identification of delirium in a variety of hospitalized 
populations. 

This review builds on previous work by Wei et al 37 in 
2006. In addition to the incorporation of more recent studies, 



we also employed QUADAS-2 for quality assessment and 
compared two statistical analysis methods when examining 
the accuracy performance of CAM and CAM-ICU, which 
had not previously been done. Overall, CAM and CAM-ICU 
showed moderate to high sensitivity, high specificity, and 
moderate to high reliability. Both tests can be administrated 
by health professionals with appropriate training to obtain 
the reliable results. In contrast to Wei et al's finding, 37 we 
observed higher specificity for both CAM and CAM-ICU 
in comparison to the sensitivity. This may be a result of 
differences in statistical methodology. However, the results 
shown in this paper support Wei et al's recommendation that 
given the relatively low values of sensitivity (approximately 
80%-85%), clinicians should not base delirium diagnoses 
solely on either CAM or CAM-ICU assessments; rather they 
should use the tests in addition to their clinical judgment. 37 
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Table 3 Quality assessment of included papers 



Study 


Risk of bias 








Applicability 








Patient 


Index test 


Reference 


Flow and 


Patient 


Index test 


Reference 




selection 




standard 


timing 


selection 




standard 


Ely" 12 


High 


Low 


Low 


Unclear 


Low 


Low 


Low 


Ely 1112 


High 


Low 


Low 


Low 


Low 


Low 


Low 


Fabbri 18 


High 


Unclear 


Low 


Low 


High 


Low 


Low 


Laurila 21 


Low 


Low 


Low 


Low 


Low 


Low 


Low 


Lin 23 


High 


Low 


Low 


High 


Low 


Low 


Low 


Gonzalez 19 


High 


Low 


Low 


Low 


High 


Low 


Low 


Leung 22 


Low 


Low 


Low 


Unclear 


High 


Low 


Low 


Ryan 27 


Low 


Low 


Low 


Low 


Low 


Low 


Low 


Radtke 25 


High 


Unclear 


Unclear 


Low 


Low 


Low 


Low 


Radtke 26 


High 


Unclear 


Unclear 


Unclear 


Low 


Low 


Low 


Guenther 32 


High 


Low 


Low 


High 


Low 


Low 


Low 


Luetz 8 


High 


Low 


Low 


Low 


Low 


Low 


Low 


Wongpakaran 29 


Low 


Low 


Low 


Low 


Low 


Low 


Low 


van Eijk 28 


Low 


Low 


Low 


High 


Low 


Low 


Low 


Gusmao-Flores 30 


High 


Low 


Low 


Low 


Low 


Low 


Low 


Heo 20 


High 


Unclear 


Unclear 


Unclear 


Low 


Low 


Low 


Neufeld 34 


Low 


Low 


Low 


Low 


Low 


Low 


Low 


Adamis 33 


Low 


Low 


Low 


Low 


Low 


Low 


Low 


Mitasova 24 


High 


Low 


Low 


High 


Low 


Low 


Low 


Ryan 35 


Low 


Unclear 


Unclear 


High 


Low 


Low 


Low 


Thomas 31 


High 


Low 


Low 


High 


Low 


Low 


Low 


Wang 36 


Low 


Low 


Low 


Low 


Low 


Low 


Low 



It is not surprising that CAM and CAM-ICU demonstrated 
similar diagnostic accuracy as they are derived from the 
same algorithm. However, it is worth noticing that there 
are wider variances of sensitivity than specificity when 
applying these two screening scales. The CAM diagnostic 



o 




C 



d 



o _ 
o 

0.0 0.2 0.4 0.6 0.8 1.0 

1 -specificity 

Figure 2 Bivariate estimate of sensitivity and specificity. 

Notes: Bivariate summary estimates of sensitivity and specificity for delirium 
screening test with 95% confidence and prediction ellipses. Upper left dots represent 
sensitivity and specificity of the Confusion Assessment Method and the Confusion 
Assessment Method for the Intensive Care Unit. Smaller ellipses represent 95% 
confidence interval of sensitivity and specificity. Larger ellipses represent 95% 
confidence interval of prediction sensitivity and specificity. Red color represents 
the Confusion Assessment Method for the Intensive Care Unit and the black color 
represents the Confusion Assessment Method. 

Abbreviations: CAM, Confusion Assessment Method; CAM-ICU, Confusion 
Assessment Method for the Intensive Care Unit. 



algorithm is comprised of four components: (1) an acute 
onset of mental status changes of fluctuating course; 
(2) inattention; (3) disorganized thinking; and (4) an altered 
level of consciousness. The diagnosis of delirium is based on 
the presence of both component 1 and 2, and either 3 or 4. 
Missing any of these diagnostic criteria due to inadequate 
training in assessment may underestimate the percentage of 
delirium cases, especially hypoactive patients. Another pos- 
sible reason for the variance in sensitivity is the difference 
in the rate of sedative drugs included in the studies which 
affects the diagnostic accuracy of delirium. 38 

We also observed a wide range of reliability across 
studies, especially for CAM-ICU. It is plausible that 
sedatives 39,40 widely used in critical care units were causing 
the fluctuation in delirium. 1112 41 Therefore, it is important to 
adhere closely to the screening protocol with regard to the 
point of time assessment, observation period as well as to 
provide sufficient training to medical or research staff when 
conducting delirium assessments. 42,43 

To our knowledge, this is the first systematic review and 
meta-analysis to evaluate CAM and CAM-ICU in comparison 
to DSM-IV using the bivariate model and Moses-Littenberg 
method. We appraised all studies with recently revised QUA- 
DAS-2 to assess quality. However, in the current review, we 
are only able to compare two of the most popular delirium 
screening tools. Further studies may be warranted to expand 
these findings. 
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Sensitivity 

1-i-B 




Figure 3 Summary receiver operating characteristic curve of delirium screen scales. 



Study 


TP 


FP 


FN 


TN 


Sensitivity 


Specificity 


Adamis 33 


21 


4 


3 


43 


0.88 [0.68, 


0.97] 


0.91 [0.80, 0.98] 


van Eijk 28 


35 


2 


40 


104 


0.47 [0.35, 


0.59] 


0.98 [0.93, 1.00] 


Ely 11 ' 12 


77 


0 


3 


16 


0.96 [0.89, 


0.99] 


1.00 [0.79, 1.00] 


Fabbri 18 


16 


3 


1 


80 


0.94 [0.71, 


1.00] 


0.96 [0.90, 0.99] 


Gonzalez 19 


27 


0 


3 


93 


0.90 [0.73, 


0.98] 


1.00 [0.96, 1.00] 


Guenther 32 


22 


0 


3 


19 


0.88 [0.69, 


0.97] 


1.00 [0.82, 1.00] 


Gusmao-Flores 30 


33 


3 


13 


70 


0.72 [0.57, 


0.84] 


0.96 [0.88, 0.99] 


Laurila 21 


26 


8 


6 


41 


0.81 [0.64, 


0.93] 
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Figure 4 Funnel plot of delirium screen scales. 

Abbreviations: TP, true positive; FP, false positive; FN, false negative; TN, true negative. 
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Conclusion 

Both CAM and CAM-ICU are validated instruments in 
the diagnosis of delirium in a variety of medical settings, 
including the emergency department, the postoperative 
recovery room, in palliative care, the stroke unit, and 
the rehabilitation unit. Health professionals with appropriate 
training can achieve similar accuracy to experts specializing 
in psychiatric evaluation. CAM and CAM-ICU both pres- 
ent higher specificity than sensitivity. Health professionals 
should be cautiously interpreting these results as superior 
sensitivity is expected to a screening scale. The incidence of 
delirium may be underestimated due to the relatively high 
rate of false negatives in a low sensitivity scale. Therefore, 
CAM and CAM-ICU instruments should not be relied on 
alone for diagnosis, but that application of clinical judgment 
(presumably based on application of DSM) is essential to 
diagnose the presence and severity of delirium. 
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Appendix I : Search stategy 

Database: Ovid MEDLINE 
Searched March 11, 2013 

1946 to March week 2 2013 

1 . exp Delirium/ 

2. deliri$.ti,ab. 

3. or/1-2 

Validated diagnosis filter 

4. (exp sensitivity/ and specificity/) or sensitiv*.ti,ab. 

5. *diagnosis/ or diagnos*.ti,ab. 

6. *Diagnostic Tests, Routine/ or diagnostic. mp. 

7. *diagnosis,differential/ 

8. di.fs. 

9. (Test or assessment or scale or checklist or instrument). mp. [mp = title, abstract, original title, name of substance word, 
subject heading word, protocol supplementary concept, rare disease supplementary concept, unique identifier] 

10. 3 and (or/4-8) and 9 

1 1. limit 10 to English language 

Embase Classic+Embase <1947 to 2013 week 11> 

Search Strategy: 

1 . exp delirium/ 

2. deliri$.ti,ab. 

3. 1 or 2 

4. exp "SENSITIVITY AND SPECIFICITY"/ 

5. sensitivity. tw. 

6. specificitytw. 

7. ((pre-test or pretest) adj probability). tw. 

8. post-test probability. tw. 

9. predictive valueS.tw. 

10. predictive valueS.tw. 

1 1 . *Diagnostic Accuracy/ 
12.3 and (or/4-11) 

PsycINFO 1806 to March week 2 2013 

Search Strategy: 

1 . (delirium or deliria).mp. 

2. exp sensitivity/ and specificity/) or sensitiv*.ti,ab. 

3. *diagnosis/ or diagnos*.ti,ab. 

4. *Diagnostic Tests, Routine/ or diagnostic. mp. 

5. (Test or assessment or scale or checklist or instrument). mp. [mp = title, abstract, heading word, table of contents, key 
concepts, original title, tests and measures] 

6. 1 and (or/2^1) and 5 
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